Case-based estimation of the risk of enterobiasis

نویسندگان

  • Mare Remm
  • Kalle Remm
چکیده

OBJECTIVE To introduce an original case-based machine learning (ML) and prediction system Constud and its application on tabular data for estimation of the risk of enterobiasis among nursery school children in Estonia. METHODS AND MATERIALS The system consists of a software application and a knowledge base of observation data, parameters, and results. The data were obtained from anal swabs for the diagnosis of enterobiasis, from questionnaires for children's parents, observations in nursery schools and interviews with supervisors of the groups. The total number of studied children was 1905. Ten parallel ML processes were conducted to find the best set of weights for features and cases. RESULTS The best goodness-of-fit according to the true skill statistic (TSS) was 0.381. Approximately equal fit can be reached using different sets of features. Cross-validation TSS of logit-regression and classification tree models was <0.24. In addition to the higher prediction fit, Constud is not sensitive to missing values of explanatory variables. The overall prevalence of enterobiasis was 22.8%; the mean of risk estimations was 47.8%. The overestimation of the prevalence in risk calculations can be interpreted as an inefficacy of the single swab analysis, or may be due to the relative constancy of the risk compared to the lability of infection and the applied objective function. CONCLUSIONS In addition to the higher prediction fit, Constud is not sensitive to missing values of explanatory variables. The main risk factors of enterobiasis among nursery school children were the child's age, communication partners, habits, and cleanness of rooms in the nursery school. Mixed age groups at nursery schools also enhance the risk.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Delphi application in solicitation of qualitative risk factors for estimation of a perceived probability of default: Case of Karafarin Bank

Unreliability of financial statements in Iran has urged this country’s financial services industry management to manipulate practices by which they could gain reliable risk scores for borrowers. This research extracts the most influential qualitative factors that would impact the default of a business relationship borrower. Solicitation of the factors is done through Delphi methodology. The mea...

متن کامل

Identification of Hazardous Situations using Kernel Density Estimation Method Based on Time to Collision, Case study: Left-turn on Unsignalized Intersection

The first step in improving traffic safety is identifying hazardous situations. Based on traffic accidents’ data, identifying hazardous situations in roads and the network is possible. However, in small areas such as intersections, especially in maneuvers resolution, identifying hazardous situations is impossible using accident’s data. In this paper, time-to-collision (TTC) as a traffic conflic...

متن کامل

Probabilistic earthquake hazard Analysis with considering Risk-Based concept (Case study of olefin 14)

Background and objective: numerous seismic hazard analysis studies are conducted annually using probabilistic methods throughout the world and Iran, which are usually different from the initial assumptions of analysis or software used. On the other hand, many researches are presented every year about new methods of earthquake hazard zoning, but so far these studies have not computed earthquake ...

متن کامل

Estimation of Value at Risk (VaR) Based On Lévy-GARCH Models: Evidence from Tehran Stock Exchange

This paper aims to estimate the Value-at-Risk (VaR) using GARCH type models with improved return distribution. Value at Risk (VaR) is an essential benchmark for measuring the risk of financial markets quantitatively. The parametric method, historical simulation, and Monte Carlo simulation have been proposed in several financial mathematics and engineering studies to calculate VaR, that each of ...

متن کامل

THE COMPARISON OF TWO METHOD NONPARAMETRIC APPROACH ON SMALL AREA ESTIMATION (CASE: APPROACH WITH KERNEL METHODS AND LOCAL POLYNOMIAL REGRESSION)

Small Area estimation is a technique used to estimate parameters of subpopulations with small sample sizes.  Small area estimation is needed  in obtaining information on a small area, such as sub-district or village.  Generally, in some cases, small area estimation uses parametric modeling.  But in fact, a lot of models have no linear relationship between the small area average and the covariat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Artificial intelligence in medicine

دوره 43 3  شماره 

صفحات  -

تاریخ انتشار 2008